Blogs Search Engine Using RSS Syndication and Fuzzy Parameters

نویسندگان

  • Athraa Jasim Mohammed
  • Husniza Husni
چکیده

The rapid development of the internet eventually increases the number of internet users triggering the need for an intelligent search engine that is able to minimize the search on world wide web (WWW) and find relevant information as requested. To overcome the issue of finding relevant information as well as minimizing the search on WWW, this paper proposes a search engine that is specifically designed and built using RSS syndication and fuzzy Parameters to search for information contained in blogs. The blogs search engine consists of three main phases: 1) crawling using RSS feeds algorithm; 2) indexing weblogs algorithm; and 3) searching technique using fuzzy logic. In RSS crawling process, the RSS feeds need to be gathered to extract useful information such as title, links, time published, and description. Next, indexing weblogs uses the links to retrieve the blog sites for text processing and for constructing the indexing database. In order to retrieve such information requested or queried by any user, an interface is provided to enable the blog search based on keyword with associated degree of importance. The density of keyword is then computed from the indexing database. The rank of the pages is computed by using fuzzy weighted average. The experiment resulted in mean average precision of 81.7% of total system performance. Keywords—Rss feeds, blog ssearch engine, fuzzy weighted average, keyword density.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

CS229 Final Project: Clustering News Feeds with Flock

The rise of blogging and RSS (Really Simple Syndication) have created a more personalized way of reading the news, with a much richer diversity of information and perspectives. Unfortunately, though, the rapid growth of these technologies has created a new problem how to handle the vast amount of information now available. Unfortunately, while RSS aggregators have helped to bring all of the inf...

متن کامل

RSS Feed Recommendation

Introduction Really Simple Syndication (RSS) Feeds allows users to access blogs and articles in an easy to read format. It cuts out the overhead of navigating websites for content and allows users to get information more quickly. Currently, the user is in total control of their RSS feeds, adding and deleting feeds according to their tastes. This requires the user to actively search out RSS feed...

متن کامل

Memeta: A Framework for Multi-Relational Analytics on the Blogosphere

The “memeta” project is developing a framework for studying the structure and content of the blogosphere. We are particularly interested in how metadata about blogs can be discovered, extracted and computed, and how this metadata can be modeled, represented and analyzed to provide new blog related services. Weblogs, or blogs, are web sites consisting of dated entries (posts) typically organized...

متن کامل

RSS, OPML and Weblog Ecosystems: A Survey of New Technologies in Internet Publication

With the rapid growth of weblogs (or “blogs”) over the past year, users require a way of rapidly accessing recent content from many different websites. Traditional websites are inadequate for this, as their content and presentation information are inseparably intertwined. This paper describes the development of the RSS (Really Simple Syndication) specifications as a solution for this problem. T...

متن کامل

A Comparing between the impacts of text based indexing and folksonomy on ranking of images search via Google search engine

Background and Aim: The purpose of this study was to compare the impact of text based indexing and folksonomy in image retrieval via Google search engine. Methods: This study used experimental method. The sample is 30 images extracted from the book “Gray anatomy”. The research was carried out in 4 stages; in the first stage, images were uploaded to an “Instagram” account so the images are tagge...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012